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2 ^iSStai b*~ fo, Paten* P.O. Box 1450, Alexandria VA 
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2 
3 
4 
6 



C 
C 



RAW SEQUENCE LISTING 

PATENT APPLICATION: . US/10/522 , 341 



DATE: 02/17/2005 
TIME: 12:04:43 



Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 

<110> APPLICANT: Kock, Michael 
. Frank , Markus 
Badur, Ralf 

<120> TITLE OF INVENTION: Novel Selection Processes 
8 <130> FILE REFERENCE: 532622010400 

10 <140> CURRENT APPLICATION NUMBER: US/10/522,341 

11 <141> CURRENT FILING DATE: 2005 01-25 
13 <160> NUMBER OF SEQ ID NOS: 71 

15 <170> SOFTWARE: Patent In Ver. 2.1 



KORED SEQUENCES 



350 
351 
352 
353 
v - > 354 
354 
. 355 
/Vf> 357 
• 358 
359 
361 
362 
364 
365 
367 
368 
370 
371 
373 
374 
376 
377 
379 
380 
382 
383 
385 
386 
388 
389 



<210> SEQ ID NO: 4 
<211> LENGTH: 427 
<212> TYPE: PRT 





the artificial sequence: coding for 



cytosine deaminase (codA) 
<400> SEQUENCE: 4 

Met Ser Asn Asn Ala Leu Gin Thr He He 
1 5 10 

Glu Glu Gly Leu Trp Gin He His Leu Gin 

20 25 
He Asp Ala Gin Ser Gly Val Met Pro He 

35 40 
Ala Glu Gin Gly Leu Val He Pro Pro Phe 

50 . 55 
Leu Asp Thr Thr Gin Thr Ala Gly Gin Pro 

65 70 
Thr Leu Phe Glu Gly He Glu Arg Trp Ala 
85 90 
Thr His Asp Asp Val Lys Gin Arg Ala Trp 

100 105 
He Ala Asn G\y He Gin His Val Arg Thr 

115 liO 
Ala Thr Leu Thr Ala Leu Lys Ala Met Leu 

130 135 
Ala Pro Trp He Asp Leu Gin He Val Ala 
145 150 

Leu Ser Tyr Pro Asn Gly Glu Ala Leu Leu 
165 170 



Asn Ala Arg Leu 
Asp Gly 
Thr Glu 



Val Glu 
60 

Asn Trp 
75 

Glu Arg 

Gin Thr 

His Val 

Glu Val 
140 
Phe Pro 
155 

Glu Glu 



Lys He 
30 

Asn Ser 

45 
Pro His 




Asn Gin 

Lys Ala 

Leu Lys 
110 
Asp Val 
125 

Lys Gin 
Gin Glu 
Ala Leu 



Leu Asp 

He His 

Ser Gly. 
80 

Leu Leu 

95 
Trp Gin 

Ser Asp 

Glu Val 

Gly He 
160 
Arg Leu 
175 



file://C:\CRF4\Outhold\VsrJ52234 1 .htm 



2/17/05 



RAW SEQUENCE LISTING DATE : 02/17/2005 

PATENT APPLICATION: US/10/522,341 TIME: 12:04:43 

Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 

391 Gly Ala Asp Val Val Gly Ala He Pro His Phe Glu Phe Thr Arg Glu 

392 180 185 190 

394 Tyr Gly Val Glu Ser Leu His Lys Thr Phe Ala Leu Ala Gin Lys Tyr 

395 195 200 205 

397 Asp Arg Leu He Asp Val His Cys Asp Glu He Asp Asp Glu Gin Ser 

398 210 , 215 220 

400 Arg Phe Val Glu Thr Val Ala Ala Leu Ala His His Glu Gly. Met Gly 

401 225 230 235 240 

403 Ala Arg Val Thr Ala Ser His Thr Thr Ala Met His Ser Tyr Asn Gly 

404 245 250 255 

406 Ala Tyr Thr Ser Arg Leu Phe Arg Leu Leu Lys Met Ser Gly He Asn 

407 260 ~ 265 270 

409 Phe Val Ala Asn Pro Leu Val Asn He His Leu Gin Gly Arg Phe Asp 

410 275 280 285 

412 Thr Tyr Pro Lys Arg Arg Gly He Thr Arg Val Lys Glu Met Leu Glu 

413 290 295 300 

415 Ser Gly He Asn Val Cys Phe Gly His Asp Asp Val Phe Asp Pro Trp 

416 305 310 315 320 

418 Tyr Pro Leu Gly Thr Ala Asn Met Leu Gin Val Leu His Met Gly Leu 

419 325 330 335 

421 His Val Cys Gin Leu Met Gly Tyr Gly Gin He Asn Asp Gly Leu Asn 

422 340 345 350 

424 Leu He Thr His His Ser Ala Arg Thr Leu Asn Leu Gin Asp Tyr Gly 

425 355 360 365 

427 He Ala Ala Gly Asn Ser Ala Asn Leu He He Leu Pro Ala Glu Asn 

428 370 375 380 

430 Gly Phe Asp Ala Leu Arg Arg Gin Val Pro Val Arg Tyr Ser Val Arg 

431 385 " 390 395 400 

433 Gly Gly Lys Val He Ala. Ser Thr Gin Pro Ala Gin Thr Thr Val Tyr 

434 * A 405 410 415 

436 Leu Glu Gin Pro Glu. Ala He Asp Tyr Lys Arg 

437 420 4 425 

A 3\ 




4798 <210> SEQ ID N 

4799 <211> LENGTH 

4800 <212> TYPE: D 

4801 <213> ORGANISM: Artificial sequence 
W--> 4802 <220> FEATURE: 

4803 <223> OTHER INFORMATION: Description of the artificial sequence 

4804 oligonucleotide primer 
4806 <400> SEQUENCE: 50 

$fc*> 4807 <sgfcgaatacg gcgtggagtjc g 
4810 <210> SEQ ID NCJ^l ^ 



4811 <211> LENGTH: p6 j V^O^ ^ 

4812 <212> TYPE: DNA^ \^ 

4813 <213> ORGANISM: Artificial sequence 
W--> 4814 <220> FEATURE: 

4815 <223> OTHER INFORMATION: Description of the artificial sequence: 

4816 oligonucleotide primer 
4818 <400> SEQUENCE: 51 




C 
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RAW SEQUENCE LISTING DATE: 02/17/2005 

PATENT APPLICATION: US/10/522 , 341 TIME: 12:04:44 

Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 



4819 eggcaggata atcaggttgg \ /\Q 

4842 <210> SEQ ID NO, 53/- ^ 

4843 <211> LENGTH: i 27 )^^^ ' / 

4844 <212> TYPE: DOTr- 7 \ 




20 



4845 <213> ORGANISM: Artificial sequence 
W--> 4846 <220> FEATURE: 

4847 <223> OTHER INFORMATION: Description of the artificial sequence: 

4848 oligonucleotide primer 
4850 <400> SEQUENCE:. 53 

E--> 4851 ^tcaacgtaa ccaaocctgc 

5506 <210> SEQ ID NO: 59 

5507 <211> LENGTH: 1011 

5508 <212> TYPE: DNA 

5509 <213> ORGANISM: Zea mays 
W--> 5510 <220> FEATURE: 

5511 <221> NAME/KEY: CDS 

5512 <222> LOCATION: (1) . . (981) 

5513 <223> OTHER INFORMATION: coding for 5 -methyl thioribose kinase 

5515 <400> SEQUENCE: 59 

5516 gca cga gca etc etc tec tct cct etc gee ggc gca teg ccc gac tgt 48 

5517 Ala Arg Ala Leu Leu Ser Ser Pro Leu Ala Gly Ala Ser Pro Asp Cys 

5518 1 5 10 15 

5520 cag tea gee tea gee atg gee gcg gag gag gag cag ggc ttc cgc ccg 96 

5521 Gin Ser Ala Ser Ala Met Ala Ala Glu Glu Glu Gin Gly Phe Arg Pro 

5522 20 25 30 

5524 ctg gac gag teg tec ctg etc gee tac ate aag gee acg ccg gcg etc 144 

5525 Leu Asp Glu Ser Ser Leu Leu Ala Tyr He Lys Ala Thr Pro Ala Leu 

5526 35 40 45 

5528 gee tec cgc etc ggc ggc ggt ggc agt eta gac tec ate gag ate aag 192 

5529 Ala Ser Arg Leu Gly Gly Gly Gly Ser Leu Asp Ser He Glu He Lys 

5530 50 55 60 

5532 gag gtc ggc gac ggc aac etc aac ttc gtc tac ate gtg cag tec gag 240 

5533 Glu Val Gly Asp Gly Asn Leu Asn Phe Val Tyr He Val Gin Ser Glu 

5534 65 70 75 80 

5536 gee ggc gee ate gtc gtc aag cag gcg etc ccg tac gtg cgc tgc gtg 

5537 Ala Gly Ala He Val Val Lys Gin Ala Leu Pro Tyr Val Arg Cys Val 

5538 85 90 95 

5540 ggg gat teg tgg ccc atg acg egg gag cgc gee tac ttc gag gee tec 336 

5541 Gly Asp Ser Trp Pro Met Thr Arg Glu Arg Ala Tyr Phe Glu Ala Ser 

5542 ' ' 100 105 HO 

5544 acg ctg egg gag cac ggc cgc ctg tgc ccg gag cac ace ccc gag gtg 384 

5545 Thr Leu Arg Glu His Gly Arg Leu Cys Pro Glu His Thr Pro Glu Val 

5546 115 120 125 

5548 tac cac ttc gac egg acc ttg teg ctg atg ggg atg cgc tac ate gag 432 

5549 Tyr His Phe Asp Arg Thr Leu Ser Leu Met Gly Met Arg Tyr He Glu 

5550 130 * 135 140 

5552 ccc ccg cac ate ate etc cgc aag ggc etc gtc gee ggt gtc gag tac 

5553 Pro Pro His He He Leu Arg Lys Gly Leu Val Ala Gly Val Glu Tyr 

5554 145 150 155 160 



288 



480 
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RAW SEQUENCE LISTING DATE : 02/17/2005 

PATENT APPLICATION: US/10/522 , 341 TIME: 12:04:44 

Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 

5556 ccg ctg etc gec gac cac atg tec gat tac atg gee aag acg etc ttc 528 

5557 Pro Leu Leu Ala Asp His Met Ser Asp Tyr Met Ala Lys Thr Leu Phe 

5558 165 170 175 

5560 ttc acc tec etc etc tat aac aat acc acg gat cat aag aac gga gtt 576 

5561 Phe Thr Ser Leu Leu Tyr Asn Asn Thr Thr Asp His Lys Asn Gly Val 

5562 180 185 190 

5564 get aag tac tct gcg aac gtg gag atg tgt agg etc acg gag caa gtt 624 

5565 Ala Lys Tyr Ser Ala Asn Val Glu Met Cys Arg Leu Thr Glu Gin Val 

5566 195 200 205 

5568 gtg ttc teg gac cca tac. cgt gtt tec aaa ttt aat egg tgg acc teg 672 

5569 Val Phe Ser Asp Pro Tyr Arg Val Ser Lys Phe Asn Arg Trp Thr Ser 

5570 210 215 220 

5572 cct tat etc gac aaa gat get gag gca gtt cgc gag gat gat gag etc 

5573 Pro Tyr Leu Asp Lys Asp Ala Glu Ala Val Arg Glu Asp Asp Glu Leu 

5574 225 230 235 240 

5576 aag ttg gaa gta get ggg ctg aaa teg atg ttt ate gag aga get caa 

5577 Lys Leu Glu Val Ala Gly Leu Lys Ser Met Phe He Glu Arg Ala Gin 

5578 245 250 255 

5580 get ctg att cat gga gat etc cac act ggt tct ate atg gtg acc gaa 

5581 Ala Leu He His Gly Asp Leu His Thr Gly Ser He Met Val Thr Glu 

5582 260 265 270 

5584 gtt caa etc aag tea ttg ate cag aat ttg ggt tct atg ggg cca atg 864 

5585 Val Gin Leu Lys Ser Leu He Gin Asn Leu Gly Ser Met Gly Pro Met 

5586 275 280 285 

5588 ggg ttt gat att ggg age ctt cct tgg aaa cct gat ttt ggg cat act 912 

5589 Gly Phe Asp He Gly Ser Leu Pro Trp Lys Pro Asp Phe Gly His Thr 

5590 290 * 295 300 

5592 atg cac aga atg ggc atg ctg ate aag cga atg ate gta agg ctt aca 960 

5593 Met His Arg Met Gly Met Leu He Lys Arg Met He Val Arg Leu Thr 

5594 305 310 315 320 
5596 aga atg gat ctt gaa gac aat tgaagagtcg tggaatttgt tccacaaa^ 1011 

E--> 5597 arg met asp Leu glu^sp asn J^^M^^ 



720 



768 



816 
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<210> 55 
<211> 5674 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: vector 
construct P BluKS-nitP-STLSl-35S-T 

ccagcttttg ttccctttag tgagggttaa tttcgagctt ggcgtaatca tggtcatagc 60 
tatttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 120 
taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 180 
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 240 
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc ,00 
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 360 
tatccacaga atcaggggat aacgcaggaa agaacatgtg. agcaaaaggc cagcaaaagg .20 
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 480 
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 540 
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 600 
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 660 
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc .20 
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa ,80 
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg .40 
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag 900 
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 960 
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 1020 
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc x080 
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca U40 
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 200 
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 1260 
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 1320 
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg g^tccagatt 1380 
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 1440 
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 1500 
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 1560 
gta^ggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 1620 
?gtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 1680 
.cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 1740 
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 1800 
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 1860 
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 1920 
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 1980 
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 2040 
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 100 
gcattta.ca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 2160 
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgcg ccctgtagcg 2220 
gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg 2280 
ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc 2340 
cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc 2 00 
tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga 2460 
cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa 2520 
c?ggaacaac actcaaccct atctcggtct attcttttga tttataaggg attttgccga 2580 
ttSggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca 2640 
aaatattaac gcttacaatt tccattcgcc attcaggctg cgcaactgtt gggaagggcg 2700 
atcggtgcgg gcctcttcgc tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg 2760 
attaag?tgg gtaacgccag ggttttccca gtcacgacgt tgtaaaacga cggccagtga 2820 
attgtaatac gactcactat agggcgaatt ggagctcgtc gagaccagat gttttacact 2880 




5 



4; 



tgaccgtaaa 
aagatgactc 
gtcaagttca 
ttcgttaata 
aatttgagtg 
tattatgaat 
catttagaat 
acatatatta 
tagaagctat 
tatatatatg 
cataaaaaga 
agtggaattt 
tttcttttat 
gggtttggtt 
attcaaaaat 
ccaataatag 
tttatttaat 
taagattaaa 
aaaatctata 
tacagctaag 
tcaatattat 
atatctataa 
gatttaagga 
aagtttttta 
attgtgtcca 
ttttatattc 
ctcaattatt 
agaaaagaaa 
cgttaccgtc 
tatattgctt 
ctgcaggcat 
tataataatt 
agaatgtagt 
aacttttcta 
tcggtacgct 
aataatgtgt 
tgtgttga'gc 
taaaatttct 
tatagatctt 
ccgttcgaag 
gggttggtta 
aaggaggaag 
ttgatagagg 
ctataccccc 
aaattggtat 
caaaatacga 
ccctcacacc 



tgagcacccg 
aaacaagtaa 
aaatgttatg 
atggaagaga 
ttcaattctg 
ctggctttga 
gtttaataat 
atttattttt 
gctaaaccat 
gatggctgtc 
ctatttcgtt 
gagttcctgc 
ctgaaagctg 
ttcttgatgc 
tttgatttct 
taaatccaaa 
ccagtttctc 
taggagccaa 
cgtcaaattt 
acaaaccaaa 
taggtagaat 
catagtatat 
atccataaaa 
tttgtttttt 
aatactttgt 
agacataaag 
agtaggtttg 
ccacgtgggc 
tccaatagga 
ctctcaagtc 
gcaagcttat 
atcattaatt 
atatagcaat 
atatatgacc 
gaaatcacca 
gagtagtttc 
atataagaaa 
aattcctaaa 
tgtcgtgaat 
ctagaagtac 
cgttgactcc 
acaaggaagg 
tacgctacta 
taataacccc 
cagagccatg 
aagagttctt 
ggtgacgggg 



aagaaaccgg 
tcggttgtga 
ttgagtttca 
accaattctc 
gatgttgtgt 
tttcagttgt 
taagttactg 
cttaaaaatg 
.tacgtatatg 
acaatcagaa 
tcattgacaa 
aagtaagaaa 
caatgaaata 
gaactaattg 
cttgtaggaa 
gatat'ttgac 
tgagataatt 
actcacattt 
tatttatgac 
aaaaaaatac 
cgtacgacca 
attattacct 
gaataaattc 
atttggtagt 
aaaccaccgt 
catcggctgg 
atgtgaactt 
tagtcccacc 
aaataaacgt 
tctgagtgtc 
cgtcgactac 
agtagtaata 
tgcttttctg 
aaaatttgtt 
gtctctctct 



ccgataaggg 
cccttagtat 
accaaaatcc 
ataaaccaga 
cgcttaggca 
cccgtaggtt 
ataaggttgc 
tacttatact 
ttatcaattt 
aataggtcta 
aactctaaag 
atcgcgatgg 



tcacattcat 
ttcgtcagtt 
aacttttatg 
ttgtatctaa 
taccctacaf 
gttcttttct 
tatttccaca 
ttttggaatg 
tgacttcaca 
acgtgatcga 
tttgtgttta 
gatgaaataa 
ttcctaccaa 
gttatataag 
tatgatttac 
cggaccaaaa 
cattaaggaa 
caaatattaa 
atttcttatt 
tttctaagtg 
aaaaaaggta 
attatgagga 
tacgggaaaa 
tctccatttt 
tgtaattctt 
tttaatcaat 
tacaaaaaaa 
ttgtttcatt 
gaccacaaaa 
atgaaccaaa 
gtaagtttct 
taatatttca 
tagtttataa 
gatgtgcagg 
acaaatctat 
gaa^tag 
gtatttgt 
agtactaaaa 
cacgagacga 
ggaggccgtt 
tggtttaaat 
aggccctgtg 
atacgctaag 
aagaaataat 
tgaccaaaac 
ataaaagatc 
gtac 



ttcgaaggtg 
catgtcactc 
ctaaactttt 
agattatcca 
tctacaacca 
tttttttctt 
tacattagtt 
actaatattg 
tgttgttgtt 
aaaaagacaa 
tttgtaaaga 
aagacttgag 
gcccgtttga 
aaactataca 
tatatgagac 
cacattgatc 
aacttagtat 
ataacataaa 
taaatttata 
gtccaaaaca 
ggttaatacg 
atcaaaatgc 
aaaatggaat 
gttttatttc 
aaacggggtt 
caatagattt 
acaaaaacaa 
tccaccacag 
aaaaaacaaa 
gtaaaaaaca 
gcttctacct 
aatatttttt 
gtgtgtatat 
tatcaccgga 
ctctctctat 
tctt 

ftaaaata 
tccagatctc 
ctaaacctgg 
agggaaaaga 
atgatgaagt 
caaggtaaga 
ggaatgcttg 
ccgcataagc 
tcaagaggat 
tttcaagatc 



gagaaagcgg 2940 



ctatgaagga 
tttctttatt 
tctatcatcc 
tgtagcca'at 
tgcatatttg 
ccaagaatat 
acaacgaaaa 
ttacattccc 
acagtgtttg 
aaagtggcaa 
tgtgtgtttt 
ttattaattg 
atccatgtta 
tttcttttcg 
tattttttag 
taacccatcc 
atggatttaa 
tttaatgaaa 
tcaattccgt 
aattagaaac 
atcaaatatg 
aaattctttt 
gtt'tggatft 
ttcacttctt 
tatttttctt 
atcaatgcag 
gttcgatctt 
aaaaagtcta 
aagactcgac 
ttgatatata 
tcaaaataaa 
tttaatttat 
tccatcgaat 
tttctccaia 




3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720" 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4 320" 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4_2&0- 



ttcgctca 
cttctatcaa 
ctaaagtccc 
agcccagacg 
tgctaaggca 
ggacggaagg 
agatggaaat 
tatttatacc 
ccccgcttaa 
aaaacctcac 
aaaactagtt 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5674 
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VARIABLE LOCATION SUMMARY DATE : 02/17/2005 

PATENT APPLICATION: US/10/522 , 341 TIME: 12:04:45 

Input Set : A:\sequence listing.txt 

Output Set: N:\CRF4\02172005\J522341.xaw I 

Jse of n's o r- Xaa's (NEW RULES): £__J I / *- ' >fl ^ 

Jse of n's and/or Xaa's have been detected in the Sequence Listing. 
Jse of <220> to <223> is MANDATORY. if ^ a or Xaa's ar e present ^ 
i r LJ 222T ES ^^i iction, please explain location of n or Xaa, and which ^ 
r esidue"n or Xaa represents. 

Seq#:55; N Pos . 5014 
Seq#:57; N Pos. 6697 
Seq#:59; N Pos . 1026 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/522 , 341 



DATE: 02/17/2005 
TIME: 12:04:45 



Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 

L-10 M-270 C: Current Application Number differs, Replaced Current Application Number 
L-ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:228 M:283 W: Missing Blank Line separator, <220> field identifier 
L-231 M:283 W: Missing Blank Line separator, <220> field identifier 
L-236 M:283 W: Missing Blank Line separator, <220> field identifier 
L-354 M-258 W- Mandatory Feature missing, <220> Tag not found for SEQ ID#:4 
L-.357 M:200 E: Mandatory Header Field missing, *220> Tag. not found for SEQ ID#:4i 
L-443 M:283 W: Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator, <400> field identifier 
Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator <220> field identifier 
L-1199 M:283 W: Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator, 



L:641 M:283 W 
L:645 M:283 W 
L:866 M:283 W 
L:1092 M:283 W 




L:1384 M:283 W: 
L:1568 M:283 W: 
L:1713 M:283 W: 
L:1802 M:283 W: 
L:1891 M:283 W: 
L:2014 M:283 W: 
L:2260 M:283 W: 
L:2606 M:283 W: 
L:2834 M:283 W: 
L:3185 M:283 W: 
L:3413 M:283 W: 
L:3642 M:283 W: 
L:3754 M:336 W: 
L:3847 M:283 W: 
L:4039 M:283 W: 
L:4225 M:283 W: 
L:4411 M:283 W 



<220> 
<220> 
<220> 



Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Invalid Amino Acid Number in Coding Region, SEQ ID: 37 

<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 



<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 
<220> field identifier 
field identifier 
field identifier 
field identifier 
<220> field identifier 
<220> field identifier 



Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
L:4597 M:283 W: Missing Blank Line separator, 
L:4782 M:283 W: Missing Blank Line separator, 
L:4802 M:283 W: Missing Blank Line separator, 

L-4807 M-252 E: No. of Seq. differs, <211> LENGTH : Input : 27 Found:21 SEQ:50 
L-4814 M:283 W: Missing Blank Line separator, <220> field identifier y 
L-4819 M-252 E: No. of Seq. differs, <211> LENGTH : Input : 26 Found:20 SEQ:51 
Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator, <220> field identifier 
No. of Seq. differs, <211> LENGTH: Input : 27 Found:20 SEQ:53< 
Missing Blank Line separator, <220> field identifier^. 
Missing Blank Line separator, <220> field identifier ' 
Mandatory Feature missing, <221> Tag not found for SEQ ID#:55 
Mandatory Feature missing, <222> Tag not found for SEQ ID#:55 
(46) "n" or "Xaa" used, for SEQ ID#:55 after pos.:4980 
Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator, <220> field identifier 
Mandatory Feature missing, <221> Tag not found for SEQ ID#:57 
Mandatory Feature missing, <222> Tag not found for SEQ ID#:57 



L:4826 M:283 W: 
L:4846 M:283 W: 
L:4851 M:252 E: 
L:4857 M:283 W: 
L:4869 M:283 W: 
L:4957 M:258 W: 
L:4957 M:258 W: 
L-.4957 M:341 W: 
L:4975 M:283 W: 
L:5087 M:283 W: 
L:5203 M:258 W: 
L:5203 M:258 W: 
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VERIFICATION SUMMARY DATE : 02/17/2005 

PATENT APPLICATION: US/10/522,341 TIME: 12:04:45 



Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 

L:5203 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:57 after pos.:6660 

L-5262 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5510 M;283 W: Missing Blank Line separator, <220> field identifier 

L-5597 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:59 after pos.:1011 

L-5597 M:254 E: No. of Bases conflict, LENGTH : Input : 325 Counted:1032 SEQ:59 - 

L-5597 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS:6 

L-5597 M-112 C: (48) String data converted to lower case, 

L^5597 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 1011 Found: 1032 SEQ: 59 

L-5673 M:283 W: Missing Blank Line separator, <220> field identifier 

L-5761 M:283 W: Missing Blank Line separator, <220> field identifier 

L-5842 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5923 M:283. W: Missing Blank Line separator, <220> field identifier 

L-6003 M:283 W: Missing Blank Line separator, <220> field identifier 

L*:6015 M:283 W: Missing Blank Line separator, <220> field identifier 

l!6027 M:283 W: Missing Blank Line separator, <220> field identifier 
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